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I. INTRODUCTION 

One of important developments in nonequilibrium statisticphysics in the past two decades is the discovery of a 
variety of fluctuation theorems (FTs) or fluctuation relations [If 0, 0, II IE, If3, 0,0,0, El El I3- These theorems were 
usually expressed as exact equalities about statistics of entropy production or dissipated work in dissipated systems. 
In near-equilibrium region, these FTs reduce the fluctuation-dissipation theorems (FDTs ) fl3l . [LH ] . Hence they are also 
regarded as nonperturbative extensions of the FDTs in far-from equilibrium region 
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gous to many new 

findings in physics, the mathematic techniques for proofing these theorems have be en p resent for many decades. For 
instance, thanks to the works of Lebowitz and Sphon [4|, and Hummer and Szabo [16j |. we know that, in Markovian 
stochastic dynamics these FTs have an very intimate connection with the Kolmogorov backward equation (1931) 
and the applications of the famous Feynman-Kac formula [13; [3 (1948) and Girsanov formula 0, Ho] (I960). The 
involvement of the backward equation or more precisely, its perturbed versions in deriving the FTs is not occasional. 
Previous many works haveproved that various FTs originate from the symmetry-breaking of time reversal in dissipated 

systems 0, 0, 0, H II i, 0, S, 0, El ED, El, HI . This p° int 

is now widely accepted and reader may reference an excellent 
synthesis from this point of view by Chetrite and Gawedzki [22]. Intriguingly, the backward equation concerns about, 
at future time given a state or a subset, how system evolves in it from a past time. Namely, the backward equation 
is a final value problem, and can be evaluated backward in time from future to past. Hence, the backward rather 
than the forward equation or Fokkcr-Planck equation is natural tool to describe time reversal. Actually, this idea 
has been im plie d earlier in finding conditions for the detailed balance principle of homogeneous Markov stochastic 
systems [23l |24|. In this work, we roughly call a discussion on the basis of past time backward to distinguish more 
conventional discussion on the basis of future time (forward). 

Although thes FTs are of importance and extensive attention was paid on them in past two decade, there were 
fewer works concerning about this connection during a long time. The reasons may be two sides. On one hand, 
physicists are not very familiar with the backward equation compared with Fokker-Plank equation. Introduction 
about the backward equation in many classic books [23|, [24| was usually about its equivalence with forward equation. 
Its application is solely first passage time or exit problems. On the other hand, as mentioned perviously, time reversal 
is very relevant to the FTs. Most of theorems could be evaluated by the ratio of probability densities of observing 
a stochastic trajectory and its reverse in a stochastic system and its time reversal, respectively [H, 0, 0, [25|. Hence 
physicists familiar with quantum physics may favor the direct path integral approach [26l . [27| . Until recently, some 
works began to investigate and exploit the connection between the FTs and the backward equation [2^, IH, [29|, H(| • 
For instance, Ge and Jiang (28j employed a perturbed backward equation and Feynman-Kac formula to reinvestigate 
Hummer and Szabao's earlier derivation [lj| about the celebrated Jarzynski equality [1, 0] from mathematical rigors. 
A generalized multidimensional version of the equality was obtained. On the basis of an abstract time reversal 
argument, Chetrite and Gawedzki (22| established an exact fluctuation relation between the perturbed Markovian 
generator of forward process and the generator of time-reversed process, though the authors did not use perturbed 
backward equations explicitly. Inspired by Ge and Jiang's idea, we obtained two time-invariable integral identities 
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for very general discrete jump and diffusion process, respectively [29|, |30(. Considering that several transient integral 
fluctuation theorems @, @, [13, [TT|, HU are their path integral representations in specific cases, we called these two 
integral identities generalized integral fluctuation theorems (GIFTs). Our further analysis showed that these GIFTs 
had well-defined time reversal explanations that are consistent with those achieved by Chetrite and Gawedzki [22| . 
Hence, their detailed versions or the transient detailed fluctuation theorems (DFTs) should be easily established. In 
addition to simplicity in evaluations, to us the most impressive point of using perturbed backward equations is that a 
specific time reversal is defined naturally and explicitly given a specific IFT, and the latter can be designed "freely" 
from the GIFTs. This apparently contrasts with conventional direct path integral approach (including Ref. 22]), 
which requires a specific time reversal first and then obtains a specific IFT. Previous works showed the definition of 
time reversal may be nontrivial, e.g. that in Hatano-Sasa equality (l2l |. 

The aims of this work are two-fold. First, we attempt to present a comprehensive version of our previous work 
about continuous diffusion process [29]. In addition that many details that were missed or very briefly reported 
previously will be made up, which mainly includes classification of the existing IFTs and time reversals and derivation 
of the transient DFTs from a point of view of the GIFT, we also present several new theoretical results. The 
most significant progress is to find that the time-invariable integral identity we obtained previously is a generalized 
Chapman-Kolmogorov equation in general diffusion processes; the path integral representation of the well-known 
Chapman-Kolmogorov equation may be regarded as the first IFT. Additionally, we uniformly obtain the GIFT for 
the Smoluchowski |3l| and Kramers type [321 ] diffusions by employing a limited Girsanov formula (see Appendix [A"]) . 
In previous works [J, |22| the latter was considered individually. Our second aim is to show that there is an alternative 
way using the backward equation to derive the classical linear response theory [Tj| [l4j| , and a simple extension of this 
"lost" approach results into the transient FTs found almost forty years later. Although it is widely accepted that 
the FTs reduce to the linear response theory when they are approximated linearly near equilibrium [T|, [j, ITU |22| , 
one may see a significant difference between their derivations: in books [23| the linear response theory always starts 
from an evaluation of probability distribution function using time-dependent perturbation theory, whereas the former 
did not use this function at all. We show this differences may be obviously diminished if one employs the backward 
equation to evaluate the linear responses of perturbed systems at the very beginning. Moreover, this reevaluation 
evokes our attention to the importance of the Chapman-Kolmogorov equation. We are tempting to think whether 
the dominated forward idea using the forward equation postpones the findings of the transient FTs in Markovian 
stochastic dynamics. 

The organization of this work is as following. We first present some essential elements about the continuous 
diffusion process in sec. [TXJ The Chapman-Kolmogorov equation, Fcynman-Kac and Girsanov formulas are explained. 
In sec. IIII1 we derive the linear response theory using the backward equation. Two FDTs that recently attracted 
considerable interest are also discussed briefly. Section HV1 mainly devotes the GIFT, which includes the relationship 
between the GIFT and the generalized Chapman-Kolmogorov equation, time reversal explanation of the GIFT, and 
classification of the IFTs and time reversals in the literature from a point of view of the GIFT. Additionally, we 
also propose a Girsanov equality and explain differences between this equality and the GIFT. In sec. [V] we derive 
the detailed version of the GIFT on the basis of its the time reversal explanation. We summarize our conclusions in 
sec. ED 



II. ELEMENTS OF STOCHASTIC DIFFUSION PROCESS 



We consider a general A-dimension stochastic system x={xi}, i=l, ■ ■ ■ . 
equation (SDE) [24j| 

d-x.it) = A(x, t)dt + B2 ( x , t)dW(t), 



N described by a stochastic differential 



(1) 



where dW is an A-dimensional Wiener process, A={Ai} denotes a A-dimensional drift vector, and B 1 / 2 is the square 
root of a NxN semipositivc definite and symmetric diffusion matrix B 



B 



D 





where D isaMxM (M<N) positive definite submatrix. We call a stochastic process Smoluchowski (nondegenerate) 
type for M=N, and Kramers (degenerate) type otherwise, because the Smoluchowski and Kramers equations [3~D . 
are their typical examples. One usually converts the SDE into two equivalent partial differential equations of transition 
probability density p(x,t|x',£') (t > t'): the forward or Fokker-Planck equation 



(2) 
rate) 



d t p = C(x,t)p = {-d Xi Ai(x,t) + ^d Xi d Xl B u (x,t)}p, 



(3) 
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and the Kolmogorov backward equation 

d t ,p = -C + (x',l/)p = -[A i (x',t)d < + ^Bu(x',t)d < d lol }p. (4) 

The initial and final conditions of them are <5(x — x'), respectively. We follow Ito's convention for the SDE and 
use Einstein's summation convention throughout this work unless explicitly stated. The forward equation defines a 
probability current 3[p(x, £)], components of which are 

Ji[p(x,t)} = Ai(x,t)p - ^d Xl [Bu(x,t)p], and £(x, t)p(x, t) = -d Xi J t [p(x, t)]. (5) 

Different from the forward equation, Eq. (j4|) is about past time t', and generally p(x, i|x', t') does not have a probability 
interpretation with respect to variable x'. The connection between the forward and backward equations may be seen 
from the famous Chapman-Kolmogorov equation [24| 

( o(x 2 ,t 2 |xi,ii) = J dx ( o(x 2 ,t 2 |x, t)p(x, t\x!,ti), (6) 

where ti<t<t2- An equivalent expression is its derivative with respect to time t, 

= d t [J dx/9(x 2 ,< 2 |x,t)p(x,t|xi,ti)] 

= J dx[d t p(x 2 , t 2 |x, t)]p(x, t\xt + p(x 2 ,t 2 \x, t) [d t p(x, t\xi ,h)]. (7) 

The reason of the left hand side vanishing is very obvious. Equation ([7]) implies the operators C and C + are adjoint each 
other if one substitutes the time-derivatives on the right hand side with forward and backward equations. Conversely, 
through the same equation we can as well obtain the backward (forward) equation using the adjoint characteristic of 
the operators if known the forward (backward) equation first. 

There are two famous formulas in stochastic theory that are employed in this work. One is the Fcynman-Kac 
formula, which was originally found by Feynman in quantum mechanics 17 1 and extended by Kac fl8j in stochastic 
process. Assuming a partial differential equation 

d t >u(x, t') = -£+(x', t')u(x, t') - g(x, t')u(x, t'), (8) 

with a final condition u(x, t) = q(x), then its solution has a path integral representation given by 

u(x, = x <*' (exp[ f .g(x(r), r)dr}q[x(t)}). (9) 



where the expectation x,t ( ) is an average over all trajectories {x(r)} determined by SDE |T]) taken conditioned on 
x(t') = x. Letting g=0 and q(x) be a 5-function, the Feynman-Kac formula also gives a path integral representation 
of backward equation (0} . The other is the Girsanov formula . Roughly speaking, the standard version of this 
formula is about probability densities of observing the same trajectory {x(t)} between time £o and t in two different 
stochastic systems: Assuming they have the same nondegenerate diffusion matrix B [=D in Eq. |(5J)] and one of them 
(denoted by prime) differs from the other only in the drift vector, A\ — Ai + a^, then the probability densities V' and 
V are related by 

V'[{x{t)}\ = V[{x(r)}]e- f ?° TC [ a K-,xM)dT (10) 

and 

K[a] = -OifB-^uai - ^(B" 1 )^^ - A t ), (11) 

where Vi=dxi/dr and the integral is defined by Ito stochastic integral. The inverse of the diffusion matrix above 
indicates the indispensability of the nondegenerate characteristic of these diffusions. Nevertheless, degenerate cases 
are more generic in real physical models, e.g., the Kramers equation [32l |. After recalled the original evaluation of the 
Girsanov formula, we find a limited version specifically aiming at the degenerate diffusions; see Appendix [AJ 
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III. LINEAR RESPONSE THEORY 



Evaluating linear response of a system to an external perturbation is essential ingredient of the fluctuation- 
dissipation theorems [TH, [l4| . For stochastic diffusion system, the conventional approach was based on the forward 
Fokker-Plank equation and applied the time-dependent perturbation theory p3l . l33l | . Here we show that the same 
results can be also achieved using the Kolmogorov backward equation. Our approach is not only relatively simple, 
but also its theoretical results are able to be extended to the later transient FTs naturally. 

Assuming a perturbed stochastic system having a Fokker-Planck operator C p — £ D (x, i) + £ (x, t), where C Q 
and £ e are unperturbed (denoted by the subscript "o") and perturbed (denoted by the subscript "p") components, 
respectively, and that the perturbation is applied at time 0. For the sake of generality, the unperturbed system may be 
stationary or nonstationary, and the type of perturbation is arbitrary. Further assuming the probability distribution 
functions of the unperturbed and perturbed systems be f (x,t) and f p (x,t), respectively. For a physical observable 
B(x), one may define its dynamic version i? p (t|x, t') by 

B p (t\x, f ) = J dx'B(x')p p (x', t\x, t') = x <*' (B(x(t))) p , (12) 

where p p is the transition probability density and B p (t\x, t) = B(x). Mean value of the observable at time t is then 
evaluated by 

<B) P (t)= J dxBp(t|x,0)/ p (x,0) = J dxB p (f|x,0)/„(x,0). (13) 

Obviously, the dynamic observable (fT2|l satisfies a backward equation analogous to Eq. ((4|) 

d t ,B p {t\x,t') = -L+(x,t')B p (t\x,t')-/:+(x,t')B p {t\x,t'), (14) 

where £<f is the adjoint operator of C c . The Chapman-Kolmogorov equation ([6]) also holds for the dynamic observable 
given by 

d t ,[J dxB p (t\x,t')f p (x,t')]=0. (15) 

Equation. (fT3|) may be regarded as a direct consequence of the above identity. 

The linear approximation solution of Eq. (fT4]) may be obtained by two approaches. The first one is to use the 
standard perturbation technique and to regard the last term in the equation as a small perturbation. We expand the 
dynamic observable to first order 

B p (t\x,t') = B (t\x,t') + B 1 (t\x,t') + --- , (16) 

and impose their final conditions B (t\x,t) = B(x) and Bi(t\x, t) = 0. Substituting it into Eq. pT|) . we obtain the 
zero and first order terms satisfying 

d t ,B {t\x,t') = -L+{x,t')B (t\x,t'), 

dt-B^tlxJ) = -£t(x,t')Bi(t\x,t') - Ct(x,t')B (t\x,t'), (17) 
respectively, and their solutions have path integral representations (e.g. Theorem 7.6 in Ref. [3~I1 ]) 

B (t\x,t>) = ^' {B(x(t))) o , (18) 
Bi^M') = ^'(J dTC+(x,T)B (t\x(r),T)) , (19) 

respectively. B a (t\x, t') is obviously the dynamic observable in the unperturbed system. Then the linear approximation 
of the mean of the observable is 

(B) p (t) = (B) (t)+ J dr J dxf (x,T)£t(x,r)B (t\x,T) + ■ ■ ■ 

= (B) (t)+ rdr([/- 1 £ c (/ )](r) J B(t)) o + .-. , (20) 
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where ( ) denotes the average over the trajectories starting from initial distribution function / Q (x, 0), and we used 
the adjoint characteristic of £ c in the second line. Then, we can obtain familiar response functions by substituting 
concrete perturbation expressions in the above equation. The second approach is more direct and interesting. Let us 
consider a "twisted" Chapman-Kolmogorov equation 

dt>[J dxB p (i|x,t')/o(x,t')] = - J dxf (x,t')£+B p (t\x,t'). (21) 

We must emphasize this is exact. Integrating both sides with respect to time t' from to t, we immediately see the left 
hand side is just the minus of the difference between the means of the observable in the perturbed and unperturbed 
systems. If the first order approximation was concerned about, namely, the subscript "p" is replaced by "o" on 
the right hand side of Eq. (f2"Tj) . we reobtain Eq. ([2U|) . Compared with conventional approaches on the basis of the 
forward equation, these two approaches here do not need time-ordering operator or interaction representation [35l. |36T| . 
Particularly, in our second approach we even do not need the time-dependent perturbation theory and path integral 
representation. 



A. Fluctuation-dissipation theorems 



The classical fluctuation-dissipation theorems state that the linear response function of an equilibrium system to a 
small perturbation is proportional to the two-point time-correlation function of the unperturbed system [l3l fl4j | . This 
topic is attrac ting considerable interest due to continuous efforts of extending the standard one to nonequilibrium 
region [35l l36l . l37l l38l . [39| . Here we briefly discuss two intriguing FDTs (38l . 133 ] in two typical physical models. In 
addition to preparing some definitions of two models that will be used in following sections, we want to show that, 
although these two theorems are nontrivial in physical interpretation, they may be regarded as simple applications of 
two general identities 

d x H,J>), K>f = 2[C(Ef)-E£(f) + (d Xi E)Ji(f)} (22) 
= £(Ef)-E£(f)+£+(E)f, (23) 

where E and / are arbitrary functions. They should be used in previous works. Interestingly, we find these two 
identities are still very useful in the transient FTs. There we will use a new identity derived from them 

£(Ef)=£ + (E)f-£(f)E-2d Xi [Ji(f)E}. (24) 



1. Overdamped Brownian motion 

Multidimensional overdamped Brownian motion is a typical example of the Smoluchowski type diffusions (40| , the 
SDE equation of which is simply 

dx = M(x, t)[-V?7(x, t) + F(x, t)]dt + B*(x, t)dW(t), (25) 

where F is a nonconservative additive force, the nonnegativc mobility and diffusion matrixes are related by 2M=/3B, 
and (3^ l ~k^T with Boltzmann constant and coordinate-independent environment temperature T. We assume 
perturbation is realized by adding a time-dependent potential —h(t)V(x) to the original one £/(x, t). Under this 
circumstance the perturbed component C c is —h(t')d Xi Mu(d Xl V). Substituting it into Eq. (|20|) . we obtain the response 
function 

R B (t,r) = S(B) p (t) /Sh(r)\ h=0 

= (f-\T)d x ,[foMud Xl V}(r)B(t)) o . (26) 

This expression seems very different from the standard FDT [l4| , even if the unperturbed system is in equilibrium. 
However, this difference is not intrinsic. Choosing £=-£ Q the Fokker-Planck operator of Eq. ([23)) and E=V(x., <'), and 
noticing that the left hand side of Eq. (|22|) is just 2£ e (fo)/h(t')(3, we obtain two new expressions of Eq. (|26|) given 

R B (t,r) = [3±(V(T)B(t)) ~f3({f- 1 Mf )d Xi V](T)B(t)) o (27) 
= § ^{V{r)B{t)) Q - | (£+(V)(r)B(t)) o . (28) 
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Although the FDT l[27|) still faces the difficulty of unknown probability distribution f a as Eq. (|26|) . it intuitively indi- 
cates that the responses are different for the unperturbed systems prepared in equilibrium and nonequilibrium states; 
the latter usually has nonvanishing probability current. In contrast, the FDT (|28|) does not need this distribution 
and is more useful in practical simulation or experiment. The second term on the right hand side was interpreted as 
a correlation with dynamical activity (39j . 



2. Under-damped Brownian motion 

The second model is one-dimension underdamped Brownian motion (no apparent differences in discussion for 
multidimensional case), 



dp = —d x Ho{x,p, t)dt + F(x, t)dt — ^^pdt + sjlm^lfi dW 
dx = d p 7io(x,p,t)dt. 

The deterministic Hamiltonian system is included by choosing 7o=0 and F=0. For convenience, we rewrite this SDE 
into a matrix form 

dr = n ■ VH dt + Fdt - T ■ Pdt + ^2mT/(3 dW, (29) 
where we define new vectors r T =[rx,r2]=[p, x], P T =[p, 0], F T =[F, 0], V T =[9 P , d x ], and matrixes 



n 



-I 

1 



7o 




(30) 



This is a typical example of Kramers type diffusions. According to the types of the perturbations, several different 
FDTs with specific conditions may be obtained. The relatively simple case is that the perturbation is still through a 
potential —h(t)V(x) and £ c =—h(t)(d x V)d p . We can of course obtain a FDT as Eq. ((26]) by directly substituting C c 
into Eq. (f20|) (not shown here). In addition, one may expect that the left hand side of Eq. (f22|) is still proportional 
£ e (/o) as that in the overdamped case. This is indeed true if choosing E=pd x V(x) and assuming 70 independent of 
spatial and momentum coordinates. We obtain 

Rb(!,t) = JL^ {{j>dxV) ^ W )) o -^([f^jmdrApd x V)]{r)B{t)) o (31) 
7oto dr 7oW 

= ^-±{(pd x V){r)B{t)) ^— (£i(pd x V)(r)B(t)) o . (32) 
270TO dr 270m N ' 

These new FDTs seem to be very different from Eqs. (|27| and (|28| in the overdamped case. For instance, Eq. (f3Tj) is 
not as good as Eq. (|27|) in concept because the current J ri are not zero even if the unperturbed system has canonical 
distribution [in equilibrium and F=0]. Particularly, these FDTs cannot automatically reduce to the standard FDT [l4| 
in deterministic Hamiltonian system by simply choosing 7o=0. These problems could be avoided if one notices the 
left hand side of Eq. (f2"2"|) vanishes for E=V(x) (the same consequence as vanishing 70) and introduces a modified 
current 

J(/ ) = J(/o) + r 1 nv/ . (33) 

Then we obtain another FDT given by 

R B (t,r) = p^(V{r)B{t)) -P{[f-\] r ^f )dr t V]{T)B{t)) . (34) 

This expression is the same as Eq. ([2"T]) . and the last term vanishes for an unperturbed system having canonical 
distribution. We must emphasize that Eq. (|34|) is suitable to the cases that 70 is any function of the coordinate r. 

For general perturbations that depend on spatial and momentum coordinates simultaneously, e.g. —h(t)V(x,p) [33j ] . 
the above FDTs usually do not hold. Considering simple case that 70 is time-dependent only. Equation (|34[) is then 
modified by an additional term 

+ Pm 10 {{[fo 1 Mfo) -p/m]d p V + r 1 d 2 p V}(T)B(t)) . (35) 

Finally, if we temporarily forget the time derivative in these previous FDTs, we can obtain a more concise FDT 

R B (t,T) = 13 ([A ri n a d ri V]{T)B(t)} - /3([/ - 1 J ri ^<9 ? ,^](r)B(t)) , (36) 

where matrix is — n(mr — II) -1 . One may easily prove that Eqs. (j3"6"]) and (|3"4"|) are identical if the potential V is a 
function of the spatial coordinate x only or vanishing 79. The above discussion about the FDTs in these two physical 
model are mainly technical. Their underlying physics may reference previous literature [38l |39| . 
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IV. GENERALIZED INTEGRAL FLUCTUATION THEOREM 



During the reinvestigation of the linear response on the basis of the backward equation, we notice that the Chapman- 
Kolmogorov equation (|15p plays a role implicitly. Particularly, we find that there are other functions or variants 
of B p (t\x, t') not only satisfying the same Chapman-Kolmogorov equation but also having the same mean of the 
observable, e.g., B' p (t\x,t') satisfying 

d t ,B' p (t\x,t') = -£+(x,i')B;(t|x,t') + f-\x,t')[f p £t - £ (/p)] (x,t')B' p (t\x,t') 

= -£+(x, t')B' p (t\x, t') - /- 1 £ e (/ p )(x, t')B' p (t\x, t') (37) 

with final condition £? p (i|x, t) = B(x). The proof is obvious if one employs the evolution equation in the first line and 
the adjoint characteristic of £ c and £+. Actually these operators could be arbitrary. Regarding the second term in 
the second line in Eq. (|37[) as a small perturbation and employing previous either approach, we will obtain Eq. (|20[) 
again. This discussion also leads into another interesting result. In physics the identification between the perturbed 
and unperturbed systems is some arbitrary. One may think of the unperturbed system as an oppositely perturbed 
consequence of the perturbed system, e.g. applying mechanic forces with opposite directions starting from time 0. 
This point is very clear in Eq. (|16|) . where Bi of course can be moved to the left hand side. Correspondingly, we have 
an equation about B' (t\x,t') that is a variation of the dynamic variable B Q (t\x, £') given by 

d t ,B' (t\x,t') = -£+(x,t')B^t\x,t') - f-Xx,t')[f £+ - £ e (f )] (x,t')B' (t\x,t') 

= -£+(x,f)fl;(t|x,t / ) + /o" 1 A(/o)(x,f)Bo(*|x,*') (38) 



with final condition B' (t\x,t) = B(x). One can obtain it as well from Eq. (|37[) by simply exchanging the subscripts 
"p" and "o" and changing the symbols before £+ and £ e into minus. Repeating previous evaluation, one obtains 
Eq. (|20p again. A more intriguing fact appears when we tried to prove the Chapman-Kolmogorov equation (|15[) for 
the function B' p using the evolution equation in the second line of Eq. (|3T[) : vanishing of the derivative with respect 
to time t' on the left hand side of Eq. fT5|) requires 

£ e (/p)(x,0 = [^-£o]/p(x,0- (39) 

It is obvious if we employ the forward equation for the distribution / p . But this point reminds us a general result: for 
an arbitrary probability distribution f(x,t) we can construct a function B(t\x, t') satisfying a perturbed backward 
equation 

d t ,B(t\x,t') = -C+(x,^)B(t\x,t') - f-\x,1?) [cV/ - £(/)] (x,t')B(t\x,t') (40) 
with final condition B'(t\x,i) = B(x), and this function satisfies 

d v [ / dxB(t\x, t')f(x, t')} = 0. (41) 



We call Eq. (|4"Tj) generalized Chapman-Kolmogorov equation because the functions therein may be beyond those in 
the standard one (fT5|) . Eq. ([4TJ]) is not yet the most general; one can still add new terms as those in Eq. (|3"T|) to obtain 
other equations, which will be seen shortly. 

So far we employed the perturbation technique to solve the backward equations (|14p and reobtained the linear 
response theory. Equations (|37p and (|38p seem unnecessary because they are not beyond the original one from 
the point of view of perturbation. However, the Feynman-Kac formula §§§ and generalized Chapman-Kolmogorov 
equation (|4ip provide us two nonperturbative relations: 

(fl)p(t) = (exp[/ /- 1 £ c (/ p )(r,x(r))dT]i?(x(i)))odT, (42) 



n 



(S)o(t) = (exp[- / /- 1 £ e (/ )(T,x(r))dr] J B(x(t))) p dT. (43) 



There is an analogous relation for Eq. (|40|) as well. We must emphasize that these relations are always correct formally 
and do not matter with the type of the perturbations. Particularly, Eqs. (|4"2")l and (|4"3"| reduce to the linear response 
formula (|20p when expanding their exponentials to the first order. In addition to the Feynman-Kac formula, we also 
notice that the Girsanov formula presents an alternative nonperturbative relation for the perturbation problem, 

(B) p (t) = (B(x(t))) p = (e- fo E ' a lW T »^(x(i))) , (44) 
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where a = hls/LS/V for the mechanical perturbation in the previous overdamped Brownian motion. At first sight, one 
may think of that Eq. (|4"4")) is superior to Eq. (142j) in that the latter does not need the unknown perturbed distribution 
function f p . However, Eq. (|44|) is based on the validity of the Girsanov formula. We have mentioned that this formula 
is not always true, e.g., the general perturbation in the underdamped Brownian motion; see a simple discussion in 
Appendix [a1 In contrast, Eq. (|42|) is robust. 

Both Eqs. (|42|) and (|44|) have to face a challenge whether they are really useful, which relies on whether they provide 
us new evaluation approaches or physical understanding about stochastic processes. It should be better to put this 
question into a more general situation, namely, whether Eq. (|40|) is useful or not. This is natural because Eqs. (|37p 
and (|38p are its specific cases. We have mentioned that even Eq. (|40|) has a more general variant, 



d t >B(t\x,t') 



-C+{^t')B{t\^t') - rW) [St'f ~ A/)] (x,t')B(t\x,t') 
+/- 1 (x,i') [C*(g)-g£+] (x,t')B(t\ X ,t'), 



(45) 



with final condition £?(t|x, t) = B(x), where g(x, t') is arbitrary smooth positive functions, the arbitrary operators 
C a and are adjoint each other. One may check that, under this case the generalized Chapman-Kolmogorov 
equation (|41[) is still true. Intriguingly, the two perturbed components in Eq. (|45[) have very distinct meanings for 
the generalized Chapman-Kolmogorov equation: the first in the first line is indispensable while the second in the 
second line is not. This point should be reflected in the physical explanations of the above equation. Rather than 
investigating very general C a , in this work we are interested in the simplest but nontrivial case: £ a and g(x, t) are 
chosen such that Eq. (|4"S")) is 



2/- 1 (x, t') [(d Xi Si)(x, t') + #(x, t')d Xz ] B(t|x, t') 



(46) 

where "• ■ •" represents the first line of Eq. (|45[) . and A^-dimension vector S={/f>j} satisfies natural boundary condi- 
tion. On the basis of the generalized Chapman-Kolmogorov equation, Feynman-Kac and limited Girsanov formulas 
(Appendix Ia^) . for a certain vector S whose last (N—M) components vanish, we obtain an identity 



( e -JSW&Wr)>r)*rB[x(t)]) = (B)(t), 

where the integrand is 

J[f,S] = r 1 [(C-d T )f + 2d Xi S t }+TZ[-2f- 1 S} 

= r 1 [(£ -d T )f + 2d Xi Si + 2f- 1 S l (B~ 1 ) u S l ] + 2f- 1 S, l (B- 1 ) tt («, - A,) , 

the inverse of B is formally defined by 



B 



D- 1 





(47) 



(48) 



(49) 



the mean on the left hand side is over the trajectories starting from initial distribution function /(x, 0) and determined 
by the stochastic process (JlJ, and the mean on the right hand side denotes the average over distribution /(x, t). We 
call Eq (|47p generalized integral fluctuation theorem, which is obviously more general than previous version that was 
limited to the Smoluchowski type diffusions (29[. Noting time in the GIFT may be replaced by any time t' (<t) and 
correspondingly the average on the left hand side is over /(x, t'). 



A. GIFT and time reversal 



As mentioned at the very beginning, the backward equation has a natural connection with time reversal. A naive 
understanding about it may define a reversed time s=t—t' (0< t'< t) and convert the backward equation into initial 
value problem. This would be useful when applying ordinary numerical approaches to the unusual final value problem. 
However, the situation is more delicate about time reversal of Eq. (|46|) . Multiplying both sides of the equation by 
/(x, t') and performing a simple reorganization, we obtain 

fi t /[fl(*[x,t')/(x > 0] = -!(^t')C + B(t\^t') + C(f)(x,t')B(t\x,t') +2d Xi [Si(x,t')B(t\x,t')]. (50) 

Compared with Eq. ([24]) . we see that, if choosing Si to be the probability current Ji(f) the right hand side becomes 
— C[B(t\x, t')/(x, t')]. Using the new time parameter s rather than t', we then obtain a time reversed Fokker-Planck 
equation for function B(t\x, i')/(x, t') and the Fokker-Planck operator is simple £(x, t — s). This argument was further 
generalized to the case with even and odd variables x under time reversal [29j . Because the stochastic process ([T]) here 
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is more general than previous one, and time reversal is very important in following discussions, e.g. the derivation of 
transient DFT, we briefly recall some definitions and main results. 

Coordinates Xi of stochastic system may be even or odd, according to their rules under time reversal: if Xi^+Xi 
is even and Xi—>—Xi is odd, e.g., momentum in Eq. (|29p ; in abbreviation Xi^>Xi=EiXi and Si= ±1. The drift vector 
splits into "irreversible" and "reversible" parts, A = A irr + A rov . Under a time reversal, we assume these vectors are 
transformed into A = A lrr + A rcv , where 

ij rr (x,0 = (51) 

irw) = -Mr (*>«)■ (52) 

Such a splitting may be arbitrary or a prior known. Additionally, the transformation of the diffusion matrix is also 
given by 

B«(x,t / ) = emBui^s). (53) 

No summation over repeated indices here. These transformations are actually an inhomogeneous extension of homo- 
geneous diffusion case [23|, [HJ . Considering a time reversed forward Fokker-Planck equation with above new defined 
drift vector and diffusion matrix, 

d s p(Z,s) = £ R (x, s)jj(x, s) = [-d £i Ai(5c,s) + ^d £i ds j B ij (5i,s)]p(x,s). (54) 
Substituting a decomposition 

p(x, s) = [J 6(t|x', i)/(x', t)dx']- 1 6(t|x > tO/(x, 0, (55) 

where 6(t|x, t)=B(x), /(x, t') is an arbitrary normalized positive function, and the prefactor ensures p(x, 0) to be 
normalized, and a performing simple evaluation, we can rewrite Eq. (|54|) as 



d t 'b(t\x,t') = -£+{x,t')b(t\x,1?) - f-\x,t')[dff -£(/)] (x,t>(*|x.O (56) 
+2/- 1 (x,t') [(fl^(/))+<Sf (/)&«] 6(t|x,0, 
where we define an irreversible probability current on the function / 

SH/) = 4"(x,i')/(x,t') - \d Xl {Buf)(^t'). (57) 

Hence, if vector S in Eq. (|46|) equals the irreversible current, the time reversal explanation of the equation is just 
Eq. (|54p . Moreover, this explanation is still valid even in case of general S. One may easily see it by constructing a 
specific splitting 

4 rr (x,i|/,S) = /- 1 (x,i)[S i (x,i) + ^,(B u /)(x,t)], (58) 
Ar(x,t|/,S) = MK,t)-Af r (yi,t\f,S). (59) 

Obviously, S is just the irreversible probability current defined by the above irreversible drift on function /, which 
we denote S lrr (/|S,/) in the following. We must emphasize that such a splitting might be not real in physics. 
The relationship between Eqs. I|46p and (|54[) presents an alternative understanding of the generalized Chapman- 
Kolmogorov equation (fITj) : the spatial integral of its left hand side is proportional to the total probability of p(x, s) 
that is time-invariable according to the forward equation (|54[) . It is worth emphasizing that the above conclusions 
do not matter with the characteristics of the diffusion matrix (degenerate or nondegenerate) . We believe that we 
should not be the first to obtain Eq. (p)6")) . This equation might be derived earlier in finding the conditions on the 
diffusion matrix and drift vector for a time-reversible homogeneous Fokker-Planck equation (A lrr =A lrr , A rev =A rov , 
and B=B) to have stationary equilibrium solution / cq (x) that satisfies the detailed balance principle [ID, [24|. We see 
these conditions are identical to the requirement that /=/ eq (x) and the other terms except for C + on the right hand 
side of Eq. ([56]) vanish, respectively. 
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B. GIFT and integral transient fluctuation theorems 

Although the GIFT (|4T[) is always correct in mathematics, their physical meaning and applications in practice are 
not very obvious given very general / and S. These problems might be answered better by choosing familiar functions 
with explicit physical meaning, e.g.. probability distribution function and irreversible probability current of stochastic 
system, or choosing very simple expressions. We have briefly reported that [2{|, under some specific choices the GIFT 
reduced to existing several IFTs 0, 0, [HJ EL E2]- Here we present detailed evaluations, and particularly we add 
the results about the Kramers diffusion and the new IFTs (j42]l and (|43|) . One will see the GIFT actually provides a 
simple and clear way to classify these IFTs. 



1. S = S lrr (/) with natural splitting 



If we prior know a splitting of the drift vector, this may be the most natural consideration. In the derivation of 
Eq. (|56|) from the time reversed Fokker-Planck equation (|54|) . function /(x, t') in the decomposition (|55|) is almost 
arbitrary. One may specify a decomposition p(y, s)ocl x b^(t\x.,t') and the new function 6(i)(i|x, t') still satisfies 
Eq. (|56p except for / = 1 therein. Because of the same p(y, s), these two decomposition has a simple connection, 

H ,|, ^JHW^ . (60) 

f(x,t') J b {1) {t\x',t)dx' 

This result immediately results into a relationship between the functionals (|47[) of the path integral representations 
of b(t\x,0) and 6i(i|x,0): 



J[f,S™(f)}(x(r),T)dT = -In H*®'t) + , J(i)(x(r)iT)dr 



V[/ J S-(/)]( X (r),r)^ = -ln^| 

where i7(i)=c7[lj S lrr (l)], and the term ln/(x(f), t) is from the final condition &(i)(i|x, 0). Given a prior known splitting 
A = A lrr + A rcv and performing a simple evaluation, the new function has an expression 

= 2Af{B- 1 )u(vi-AD-d Xi AT" (S), (62) 

where A 1 " = A 1 " —d X[ Bu/2, and letter "S" in the second line denotes that time integral of this equation is Stratonovich 
integral [4l|. Compared with the original one, function &i(t|x, t') is distinctive because its functional is completely 
determined by intrinsic characteristics of the system and environment, including the drift vector and diffusion matrix. 
Moreover, the above functional identity (|6ip definitely states that, for any pair of functions having the same expressions 
at times and tjtheir GIFTs under this consideration are completely identical. An analogous expression was obtained 
earlier in Ref. [22| [Eq. (7.5) therein] by using an abstract time reversal argument. We may emphasize that Eq. (j6"2")) 
is more general than the previous one, because it also accounts for Kramers diffusion, which is seen shortly. 

Equation (|62p has simpler expressions for the two physical models in Sec. IIIII For the overdamped Brownian 
motion (|25[) with even variables only (e^=+), a conventional splitting is 

A irr = A(x, t), A rcv (x, t) = 0. (63) 

Then we have S lrl '(/)=J(/). The time reversal of this splitting was called reversed protocol (25|. Correspondingly, if 
the mobility matrix and the environment temperature are constant, is simply 

p(-d Xi U + *Fi)vi (64) 



Another example is the underdamped Brownian motion (|29p . Different from the overdamped case, this model has 
even spatial coordinate and odd momentum coordinate. For a simple Hamiltonian TLq = p 2 /2m + U(x,t), we have a 
canonical splitting 

A ilT (r,i) = -r-P, A rcv (r,t) = 11- VH Q + F. (65) 
Then p-component of the irreversible current on function / is 

S l "(f) = -joPf(p,x)-d p [p- 1 m 10 f(p,x)}, (66) 
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and x-component S'"' r (/) vanishes. Therefore, the condition for the GIFT (|47|) with degenerate diffusion matrix is 
satisfied. Under an assumption of constant friction coefficient and environment temperature, J^n\ is simplified into 

rj in/I) 

-P-^( — ) + f3(-d x U + F)v. (67) 

We see the overdamped result (|M]) can be obtained by letting m = in the above equation. If the temperature is a 
function of spatial coordinate, one may easily check that the time integral of J{\\ is Eq. (6.12) in Ref. Q that was 
called entropy flow from the system to environment along a trajector y. 

The physical meaning of functional (f6~Tj) has been well understood [id [Til . [22j : If the function / is the probability 
distribution function p(x, t) of the stochastic system, the first and second terms are the Gibbs entropy production of the 
system and the entropy production in environment along a stochastic trajectory between times and t, respectively. 
Hence the GIFT (|47|) under this consideration is the IFT of the overall entropy production given a specific splitting. 
This theorem also presents that, for a diffusion process the mean overall entropy production of stochastic system is 
always nonnegative (the second law of thermodynamics). This point may be seen by directly using Jensen inequality 
to the GIFT with B=l or evaluating the mean instantaneous rate of overall entropy production, the latter of which 
is 

{J[p, S irr (p)]) = 2 j dxp-'STipKB-^uSrip) > 0. (68) 

Noticing that the other terms in Eq. (|48|) all vanish after ensemble average (the last term due to the definition of Ito 
integral [11]). Noting Eq. (|S"8j) also holds for any vector S with natural boundary condition. 



2. Vanishing S with posterior splitting 



For an arbitrary vector S, the above results (f6T)|) - (f6"2")) are still correct except that they are about B(t\x,t') and 
S,x, t') and their functionals, where the decomposition p(y, s) ocl x £?(i)(i|/, S,x, if). Significantly different 
from previous case, both Br^ and its J7(i) depend on / and S through the splitting (|58[) . Because such a splitting is 
defined under these given functions, we roughly call it posterior. Rather than discussing a general vector, we focus 
on the simplest case S = 0. Correspondingly, the splitting is 

4"'(x,i|/) = ^j^-d Xl (B u f)(x,t), Ar(x,t\f) = A(x,i)-A irr (x,t|/). (69) 

Substituting them into Eq. (|6"2"|) , we obtain 

J {l) [f] = f-\C + v i d Xi )f (S). (70) 
The same result can be achieved simply by employing the relation d/dr=d T +Vid Xi and 

j[f,o]=r i (c-d T )f. (71) 

Equation (|69[) shows a posterior splitting is usually /-dependence. But there is an intriguing exception if the drift 
vector and diffusion matrix of a stochastic system satisfy the detail balance conditions when time parameters in them 
are fixed. Such a system has a transient equilibrium solution 

£(x, i)/ cq (x, t) = 0, S irr (/ oq ) = 0. (72) 

and this solution has a simple Boltzmann distribution. For instance, in the models (|25j) and (|29p with constant 
mobility matrix and friction coefficient, if nonconservative forces there vanish, such solutions indeed exist and / cq oc 
exp[— PU] and oc exp[— /3(p 2 /2m + U)], respectively. Hence, if we choose / = / cq (x, t), the splitting ((69)) is no longer 
/-dependent and Eq. (|TTj) becomes 

^7 E [/° q ,0] = -d T \n.n. (73) 

The time integral of the above equation was called the dissipated work. One easily sees that, under this case the 
GIFT P7|) with B=l and S(x — z) are the celebrated Jarzynski equality @, 0] and the key Eq. (4) in the Hummer 
and Szabao's work [l6l |. respectively. Although the splitting here is the same with the natural splitting we discussed 
previously, we must point out that, in the Jarzynski equality, stochastic trajectories start from an initial equilibrium 
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distribution. In contrast, the IFT of the overall entropy production is valid for any initial distribution besides 
equilibrium state and even in the presence of nonconservative forces. 

A famous example of virtually /-dependent splitting in the literature is for the stochastic system having transient 
nonequilibrium steady-state [12j, 

£(x,t)/"(x,t)=0, J(.D^0. (74) 

e.g., nonconservative forces nonzero in the models (1251) and (|29|) . The time reversal corresponding the splitting of 
Eq. |69|) with f=f ss was also called current reversal [22(. Equation (jTTj) under this case becomes 

JHs[/ ss ,0]--a T ln/ ss . (75) 

We see it is almost the same with Eq. ([75)1 thought their splitting or time reversals are completely different. The 
time integral of the above equation was called the excess heat or entropy production and the GIFT with B=l is the 
Hatano-Sasa equality [l2l |. Noting stochastic trajectories of this theorem start from a nonequilibrium steady-state. 

In addition to the above two well-known IFTs, Eq. (f7Tj) also reveals several simpler IFTs with vanishing S. The 
most obvious case is to choose f=p the distribution function of the stochastic system itself and J[p,Q\=0 simply. 
Correspondingly, Eq. (|46|) reduces to the standard Kolmogorov backward equation ((4|) and now the GIFT (|47|) is 
trivially the path integral representation of the standard Chapman-Kolmogorov equation also see Eqs (fl"2"|) . JT3]) 
and (fT5|) . The splitting or time reversal ([69)) in this case was called complete reversal [22| . The other IFTs are relevant 
to the perturbation problem in Sec. (jllip . We choose the stochastic systems to be the unperturbed one C=C and 
f=f p or the perturbed one C=C p and f—f Q as discussed previously, Eq. (fTTj) then becomes 

J[f P ,0] = fp'^o - d T f p ) = -/- 1 £ c (/ p ), (76) 
J[foA = fo\C p - drf ) = f-'Ceifo), (77) 

respectively. We immediately see that the corresponding GIFTs are Eqs. (|4"2"j) and (|4lS|) , respectively. Although 
these identities look very similar, their time reversals definition are significantly different. Let us consider a simple 
situation that the unperturbed system is in equilibrium /° q (x) and the perturbation A e (x, t) is imposed on the drift 
vector A (x)=Aq GV (x) + A" r (x) as usual. Obviously, for the case C—C a and /=/ p , the posterior splitting (|6"5|) is 
/p-dependence. We usually do not know their concrete expressions due to the unknown f p . On the contrary, for the 
case C=C P and f=f , because /o q ( x ) satisfies the detailed balance condition, Eq. (|69|) is simply 

A irr (x,i|/ ) = A* r (x), A rev (x,t|/ ) = A7(x) + A c (x,i). (78) 

This is a new example with vanishing S and /-independent time reversal particularly. Whatever the perturbation 
is reversible or irreversible in physics, it is always classified into the reversible drift in the time reversed system £r. 
This point is interesting for physical model with vanishing A™, e.g., the overdamped Brownian motion (|25[) with 
vanishing nonconservative force. 

Different from Eq. (|68p . because function / is usually not identical to system's real distribution function p(jc,t), 
we cannot interpret the ensemble average of Eq. (|7ip as mean instantaneous rate of overall entropy production (|68p , 
though it is always nonnegative (Jensen inequality). However, they are indeed connected by the following relation, 

f J[f, S irr (/I/) = 0](x(r), r)dr = In + f J[p, S irr (/ J |/)](x(r), r)dr, (79) 

where we have assumed / and p have the same distribution at time 0, the functional on the right hand side is for the 
new function D defined by a decomposition p(y, s)oc p(x, t')D(t\x, t'). We must emphasize that both the time reversed 
Fokkcr-Planck equation for p(y, s) and the irreversible probability current on the system's distribution function p here 
are constructed by the posterior splitting (|69|) . Equation ([79]) can be easily proved on the basis of Eq. (j6Tj) . Averaging 
both sides of the above equation with respect to the distribution function p, we see that the second term on the right 
hand side is the mean overall entropy production during a fixed time t given the specific splitting (|69[) . and the first 
term is the relative entropy between the two distributions p and / at time t, which is always nonnegative. Hence 
we call the left hand side of Eq. (|T9"|) overall relative entropy production functional [12] ■ We may point out that the 
above results are also suitable to the cases with nonzero S, e.g., see Eq. (|82[) below. 

C. Girsanov equality 

Recalling Eq. (|4"5| . one may notice that any ensemble average of the term / _1 Si(B~ 1 )uSi is always non-negative 
due to the semipositive definite diffusion matrix B. In fact, this observation has alternative indirect explanation. 
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Considering a perturbed forward Fokker-Plank equation 

d t p' = £'(x, t) P ' = £(x, t) P ' + 2d Xi [/^(x, i)5i(x, ty (x, t)]. (80) 
Employing the limited Girsanov formula, we obtain an identity 

( e - /o K[-2/- 1 S](x(r).r)dr B(x ^ ) ^ = (^'(^ ( 8 1) 



and previous Eq. (|44|) is its specific case. We call the above equation with B = 1 Girsanov equality. Speck and Scifcrt 
first obtained such type of equality in a specific case with S=J(/ SS ) and f~f ss the transient steady-state defined in 
Eq. (f?4")) [42| . Jensen inequality indicates the ensemble average of the functional of the equality is nonnegative. It 
is worth emphasizing that Eq. (|8Tj) is related to the standard Chapman-Kolmogorov equation ([7]) rather than the 
generalized one (j4Tj). This point can be seen from the fact that the means of both sides are respectively over p'(x, 0) 
and p'(x,t) rather than / functions in the GIFT (|47j) . This analysis also reminds us an interesting relation given the 
vector S divergenceless: 

f J[f,S}(x(r),T)dr = ln^M+ f J[p, &"(p\f, S)](x(r), r)dr (82) 
Jo /( X W>*) Jo 

= f J-[/,0](x(r),r)dr+ / ^[-2/- 1 S](x(r), r)dr. 
Jo Jo 

The first line is the version of Eq. (|79[) for nonzero S, and the condition p(x, 0)=/(x, 0) was assumed. It is not 
difficult to find a nontrivial divergenceless vector, e.g., J(/ ss ) in the overdamped Brownian motion (|25J) with nonzero 
time-dependent nonconservative force, which was also the case investigated by Speck and Seifert [42|. Under this 
consideration, choosing / the transient steady-state and further assuming the stochastic system to be in noncquilibrium 
steady states / ss (x, t) at t, we find the first line is just the overall entropy production functional of the system, and 
the first term in the second line is the excess heat or entropy production functional (|75|) . Hence the last term was 
called housekeeping heat functional to consist with steady-state thermodynamics (43|. 

V. TRANSIENT DETAILED FLUCTUATION THEOREM 

The path integral representation of the solution of Eq. (|4l)|) presents a relationship between B(t\x,t') with general 
final condition and the one B(x 2 , i 2 |xi, ti) with specific final condition S(xi — X2), which is simply 



B(t|xi,ii) = f dx 2 (exp[- f JdT]5(x{t 2 ) - x 2 ) x cxp[- f Jdr)B(x(t))) 
J Jt t Jt 2 

= J dx 2 fl(f|x2,ta)fl(xa,t 2 |xi,*i). (83) 

In the first line we inserted a (5-function at time ti between times t\ and t, and the second line is a consequence of 
Markovian property. One may see this relationship is analogous to the Chapman-Kolmogorov equation ([S]), and a 
forward equation for i?(x 2 , £2|xi, ti) can be easily derived. On the other hand, the probability distribution function 
of the time-reversed Eq. (|54[) at time si=t—t\ can be constructed by the distribution function at earlier time S2=t—ti 
given the transition probability j>r, 

p(xi,si)= J p- R (x-i,si\x2,s 2 )p{x.2,s 2 )dx.2- (84) 

On the basis of Eq. and a comparison between Eqs. and (|54"|) . we obtain 

Pr(xi, si|x 2 , s 2 )/(x 2 ,i 2 ) = S(x 2 ,i 2 |xi,*i)/(xi,ii). (85) 

Here we used symbol BQ in Eq. (|4"o| rather than b() to indicate the generality of this identity. For a time-reversible 
homogeneous stochastic system that was mentioned previously, if we choose S = S IIT (/) and / = / cq (x), both the 
transition probability pr(x, t\x' , t') (t > t') of the time-reversed system (fM)) and B(x, t\x' , t') defined here are identical 
with the transition probability p(x, t\x' , t') in Eq. ([3]). Under this consideration, the above identity is just the principle 
of detail balance written in terms of conditional probabilities [2^, [24| . An analogous expression has been obtained 
earlier in Ref. [22j [Eq. (7.15) therein] and was called generalized detailed balance relation. We may point out that, 
compared with previous one the validity of Eq. (|85p is larger. 
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Now we consider an ensemble average of a (k + l)-point function over the time-reversed system (|54|) . 

(G[x(s ), • • • ,x(s fc )]) R = / p R (xo,so|xi,si)---p R (x fc _i,Sfc_i|xfc,Sfc)/(x fc ,t)G(x ,--- , x fe ) dxj, (86) 

J n 



where Sk=t—tk, t=so>s\>- ■ ■ >Sfc=0, and we chose the initial distribution p(x,0)=f(ex,t). Employing Eq. Ij85|) 
repeatedly, the right hand side of the above equation becomes 

/ B(xfc,tfc|xfc_i,tfc_i) • • •B(xi,ti|x ,to)/(xo,to)G(xo, • • • ,x fe ) J|dx.j, (87) 

o 

Remarkably, letting k—+oo the function G becomes a functional over the space of all stochastic trajectories. Wc 
then obtain a very general identity 

(g}n = (Ge-f« J ^ s ^ T ^ dT }, (88) 

where £/[{x(,s)}] = Q[{ex(t — s)}]. Obviously, choosing Q to be a terminal function £?(x(i)), we obtain the GIFT (147|) . 
Another important choice of the functional is 

6(h-£ t [{x(r)}})=6(h- [ J[f,S](x(r),T)dT). (89) 

Jo 

Employing Eqs. (|51|) and it is easy to prove that the overall relative entropy production functional £ 4 [{x(t)}] 
has the following property, 

rt 



£t[{5t(s)}} = I J[f,S}(eSc(t- s ),s)ds 
Jo 



= -{- In + (S) J^\2^(B-%(i 3 - Af v ) - d Zi Ar}(x(s),s)ds}, (90) 

where Vj = dxj/ds. Recalling the initial distribution of the time- reversed process that was defined in Eq. (|86p . the 
right hand side of the second line is just the minus of the overall relative entropy production functional £ t R [{x(s)jl in 
the time-reversed system. This observation could be derived by the involutive property of time reversal as well [221 ] . 
Substituting the functional (|8"9"|) into Eq. (T551) . we obtain the transient DFT @,@] 

P R (-h) = P(h)e' h , (91) 

where P R (/i) is the probability density for the stochastic variable £ t R =/i achieved from the reversed process with 
the specific initial distribution mentioned above, and P(h) is the probability density for £t=h achieved from the 
forward process Q. 



VI. CONCLUSION 



In this work, we have tried to unify the derivations of the linear response theory and the transient fluctuation 
theorems using the perturbed Kolmogorov backward equations from a backward point of view. The motivation of 
this reinvestigation of the linear response theory is that conventional approach of the theory is based on the forward 
Fokkcr-Planck equation and time-dependent perturbation, which is not used in the FTs evaluations. Our results 
show that, a derivation using the backward equation could be very simple and flexible even if unperturbed system is 
non-stationary. Importantly, this study also reminds us that the time-invariable integral identity we found previously 
is the generalization of the well-known Chapman-Kolmogorov equation. One may notice that our evaluations 
heavily depend on the path integral representation of the perturbed Kolmogorov backward equations. Only in this 
representation, the physical relevances of these partial differential equations appear explicitly. This situation is very 
analogous to the relationship between the Schrodingcr equation and Feynman path integral in quantum physics. 
Hence one might criticize that these perturbed backward equations are unnecessary because all above results could 
be evaluated by direct path integral approach. This point is of course correct in principle. However, as mentioned at 
the very beginning, such a "bottom-up" idea needs the known time reversal or splitting of the drift vector. Except 
for very simple or intuitive cases, e.g., those considered in Sec. IIV B 11 finding a meaningful time reversal or splitting 
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is not trivial task. It would be desirable if there are some rules or guides for this task. We think that these perturbed 
backward equations satisfy this demand; see Eq. (|58[) . This is also logical. After all, the FTs are identities of ensemble 
statistic properties of stochastic processes. In a word, the roles played by these perturbed backward equations and 
their path integral representations are complementary in the study of the FTs. Considering that the generalized 
Chapman-Kolmogorov equation is the cornerstone of this work, which intrinsically arises from the Markovian 
characteristic of diffusion processes, we believe that the evaluations and results developed here should be also avail- 
able to other Markovian stochastic processes, e.g. general discrete jump processes with continuous (30l | or discrete time. 

This work was supported in part by Tsinghua Basic Research Foundation and by the National Science Foundation of 
China under Grants No. 10547002 and No. 10704045. 



APPENDIX A: LIMITED GIRSANOV FORMULA FOR DEGENERATED DIFFUSION MATRIX 

As that shown in Eq. (fTTj) . the Girsanov formula requires the diffusion matrix to be positive definite. This point may 
be better a ppr eciated by first writing out the probability density of a stochastic trajectory {x(r)} in the stochastic 
system [27|: 

JV . M t M 

= [IJ H*k~ M)] / Y[v[ m ] exp[-- / THtHda] - M - <P^)m], (Al) 

fc>M 1 ^° i=l 

where rji = dWi/ds is standard white noises, and S- functions should be understood a product of a sequence of terms 
on all times between and t. The expression in the first square brackets on the right hand side indicates that noises 
only act on the first M coordinates. Assuming another stochastic system (denoted by prime) has a different drift 
vector A'=A+a. Obviously, if there is any nonzero component (k>M), the S functions in the first square brackets 
makes the ratio of the two probability densities of the same trajectory in these two systems meaningless. In physics 
this means we never observe the same trajectory in these two systems. We met such a situation in the discussion of 
the FDTs of the underdamped Brownian motion with the general perturbations (44j; see sec. HII A~2l On the contrary, 
if nonzero components of a are restricted to first M, namely, ak—0 (k>M), the ratio or Radon-Nikodym derivative 
of these two probability densities can be always established and is 

P'[{x(r)}] = P[{x(t)}]<T £ Ki*](TMr))dT^ (A2) 

where the integrand is the same with Eq. (jlip except that the diffusion matrix B there is replaced by the positive 
submatrix D and the summations are restricted to first M component. We call Eq. (|A2[) limited Girsanov formula 
to distinguish with the standard one. We may conveniently rewrite this limited formula into the standard one by 
formally defining the inverse of the diffusion matrix B; see Eq. (|49p if we bear in mind the application condition. 
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